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(3) the program was most successful with children who were just 
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explore speech recognition's potential for reading instruction and 
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Talking to the Computer 

A Prototype Speech Recognition System for Early Reading Instruction 



llene Kantrov 



This paper reports on the field tests of one speech recognition system 
for early reading instruction. The software expanded the use of 
speech recognition technology beyond simple drill and practice by 
providing meaningful and appealing contexts in which young chil- 
dren (here, kindergarteners and first-graders) explored an eighteen- 
word vocabulary list. The results of these field tests reveal both the 
particular challenges involved in designing educational programs 
using speech recognition technology, and promising directions for 
future work. 



Homputer software that talks— through speech synthesis or digitized speech- 
has become commonplace (Parham 1988). We now have a variety of talking 
ord processors and programs that issue spoken instructions, as well as programs 
that read stories and speak the dialogue of plays. 

Computer software that listens, on the other hand, still remains quite rare. The 
technological and economic barriers to good quality speech recognition at an 
affordable price remain formidable. A number of products have appeared on the 
market, but none has met with great commercial success, and nearly all have been 
aimed at the business rather than the school market The few speech recognition 
products targeted at schools, 2 such as those produced by Chatterbox Voice Learning 
Systems, focus on drill in reading and math. The Chatterbox Voice Reading Ability 
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Drill software, for example, drills students on 
lists of words (from a total vocabulary of 800 
words), providing feedback for correct and In- 
correct responses. 3 

Yet the educational possibilities of speech rec- 
ognition technology go beyond drill and prac- 
tice. Hopes and expectations for significant 
applications of speech recognition have been 
especially high for reading instruction 
(Blanchard et al. 1987; Strickland et al. 1987). 
This report describes a set of software programs 
for early reading instruction developed In the 
mid-1980s by me and my colleagues at Educa- 
tion Development Center in Newton, Massa- 
chusetts. The software was designed to demon- 
strate some of the capabilities— beyond simple 
drill and practice — of speech recognition, 
combined with speech output, for young chil- 
dren learning to read. We were particularly 
interested in using the technology to help 
children learn to read words in meaningful and 
appealing contexts, as many reading specialists 
advocate. 

Building on the original set of demonstration 
programs, we had hoped to develop a series of 
speech-based reading software that would take 
full advantageof the possibilities of the technol- 
ogy. Because of the immaturity of the technol- 
ogy, however, as well as the volatility In the 
microcomputer hardware and software business 
at the time (1984-85), our demonstration pro- 
grams never reached the market and we never 
got to develop additional software. In the 
1990s, as prices of speech recognition systems 
come down and quality goes up, our experience 
with the technology may prove relevant to 
others beginning to explore speech recognition's 
potential for reading instruction and 
remediation. 4 

The Technology 

Our product used an interactive speech technol- 
ogy system developed by Dragon Systems, Inc. 
The system included speech recognition soft- 
ware (which was integrated with our applica- 
tion software) and a printed circuit board with 
speech output as well as speech input capabili- 
ties. The high-quality speech output, achieved 
using custom-encoded LPC (linear predictive 
coding) speech synthesis, was produced with a 



Texas Instruments chip. The system ran on an 
Apple 11+ or He computer with 64K and one disk 
drive. 

The Dragon system could recognize up to thirty- 
two words at any given time. It was speaker 
dependent— meaning that each speaker had to 
"train" the system to recognize his or her pro- 
nunciation of the words In its vocabulary. And 
It recognized only isolated, not connected, 
speech— meaning that each word had to be 
preceded and followed by a brief silence. 

The great advantage of the Dragon system was 
that, compared to other low-cost speech 
recognizers (and even some not so low-cost 
systems), It had great reliability. That Is, It did 
not make many errors in recognition. Using a 
standard test of Isolated word recognition, the 
system performed with 99.3 percent accuracy. 
Since the standard test was performed under 
optimum conditions, however, one object of 
our project was to determine whether the sys- 
tem was sufficiently reliable when used byyoung 
children in typical school settings to permit its 
use in early reading instruction. 

The Early Reading Software 

We developed and tested four .software compo- 
nents: 

• A training program, which both introduced 
the child to the recognizer's initial eighteen- 
word vocabulary (divided into two chunks 
of ten and eight words) and allowed the 
recognizer to collect the speech samples 
needed to run the other programs. 5 Clever 
graphics and music made the training pro- 
gram fun for children to use. 

• A story program, entitled "Kody's Jungle 
Adventure," which helped the child to read 
the initial vocabulary in a meaningful and 
enjoyable context. 

• A set of three games designed to help the 
child master the reading vocabulary. 

» A program entitled "Your Words," which 
enabled a teacher or parent to enter addi- 
tional vocabularies (of up to sixteen words 
each) which children could then train and 
use to play the games. 

All components provided instructions and 
prompts usingspeech output; even the program 
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menus used speech as well as readily recogniz- 
able icons. Professional artist Jim Carson created 
the programs' visually appealing graphics, and 
original music was composed by a talented 
young musician, Paul Tegels. 

The following sections describe the story pro- 
gram and the games in some more detail. 

Kody'% Jungle Adve iture 

The story program, which we developed with 
the guidance of readlngspedalist Carol Chomsky 
of the Harvard Graduate School of Education, 
was designed to give children practice in read- 
ing words in a meaningful context. The program 
featured a friendly koala named Kody. Speech 
output set the scene— Kcdy was lost In the 
Jungle— and provided simple instructions: The 
child could help Kody find his way home by 
reading the words highlighted on the screen. A 
series of two-part episodes then presented Kody 
with a variety of obstacles (such as an elephant, 
a zebra, a tree, or a flower) and options for action 
(such as riding, tickling, climbing, or jumping). 
By reading the words highlighted on the screen, 
the child got the obstacles to appear and then 
got Kody to perform the actions. 



For instance, the message "Here comes the el- 
ephant/ might appear, with "elephant" high- 
lighted. The program would say, "Here comes 
the." If the child said "elephant," the elephant 
would appear on the screen (see figure 1), ac- 
companied by a musical theme. 

Next, the message "Kody wants to lift it." might 
appear, with "lift" highlighted. The program 
would say, "Kody wants to." When the child 
said "lift," Kody would lift the elephant, and the 
music would play. 

If the child said something other than the word 
called for, the program would suggest that she 
"Try again." If the child stilt could not read the 
word, the graphic of the obstacle or action 
would appear, and she would get another chance 
to read the word. The graphics therefore served 
as prompts as well as rewards for successful 
performance. If the third attempt failed, the 
program would read the entire sentence and 
then proceed. 

At the end of each episode (obstacle plus action), 
a graphic showing Kody's progress through the 
jungle appeared at the top of the screen, and he 
was shown running toward home. After four or 




Figure 1. 
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Figure 2. 



five episodes (depending on the number of 
words trained), he would reach home. Each 
time the child read the story, it would consist of 
a different set of episodes; a total of twenty- 
eight combinations of obstacles and actions 
were available. 

The story program was modeled on the way a 
parent or teacher may read a story to a child. 
The adult will often stop at various points and 
ask the child to read the next word. Of course, 
we had no intention of replacing such adult- 
child interaction; rather, we wanted to give 
children additional practice in recognizing words 
in meaningful contexts. We also expected the 
program to require some adult supervision, es- 
pecially when first used. We felt that adults— or 
older children— should interact with young- 
sters learning to read, at the computer as well as 
away from it. 

As for the story vocabulary, all the words were in 
the kindergarten to second grade range. Because 
our goal was to demonstrate the speech 
recognizer's capabilities, rather than to supple- 
mental particular classroom reading program, 
we made no attempt to correlate the vocabulary 
with standard reading texts. Instead, we aimed 



for a range of difficulty and for words that would 
make the program interesting to children. 

The Games 

The software included three reading games: 

Elephant Game (Rearrange the Words): Words 
appeared In a 3 x 3 grid, with one blank square. 
The child could move any word adjacent to the 
blank square into that square by reading It. The 
object was to move the word In the bottom 
lefthand corner to the top right— thereby let- 
ting the elephant out a door at the top of the 
grid. This game provided a lot of practice In 
reading the words, as most words had to be 
moved more than once. 

Zebra Game (Tic Tac Toe): The child read the 
word in the box he wanted to mark with an X; 
the program played O (see figure 2). When the 
child won, the zebra danced. 

Monkey Game (Concentration): Words ap- 
peared in a 3 x 3 grid. The child could reveal the 
colored rectangles "behind" a word by reading 
it, and then matched pairs of rectangles to 
reveal a piece of a picture of a monkey. When 
all the pairs were matched, the picture of the 
monkey was complete. 
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The games were Independent of the story. Chil- 
dren could play the games with the story vo- 
cabulary or with any other set of words that they 
trained the program to recognize. (These addi- 
tional sets could consist of anywhere from seven 
to sixteen words.) A number of additional 
vocabularies were provided, or teachers and 
parents could use their own word lists. Even 
children who had mastered the story vocabu- 
lary enjoyed practicing their reading skills by 
playing the games, and they could be further 
challenged by more advanced vocabularies. 

Field Testing and Revising the System 

We conducted three field tests of the system 
over a period of about six months with a total of 
forty-two kindergarten and first-grade children 
in two Boston-area public schools. In the first 
two field tests, which took place in Watertown, 
Massachusetts, children were brought in pairs 
to the school gymnasium, where we had set up 
the system. An ndult introduced the program 
and helped the children as needed. An observer 
and video camera operator were also present. 
The third field test, in Newton, Massachusetts, 
took place in the school's computer laboratory, 
a setting more typical of actual school computer 
use. Students again worked in pairs with an 
adult, and the session was likewise observed and 
videotaped. Before each session, the adult ad- 
ministered a pretest to see if the children could 
read the words in the story vocabulary before 
encounteringtheprogram. Children weretested 
on these words again after the session. Partici- 
pants in all three tests inctuded students of low 
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average, and high ability as rated by their 
teachers. 

The children who tried out the system in these 
field tests enjoyed using it, despite some techni- 
cal glitches— especially in the first two trials. 6 
Our field testing showed that the opportunity 
the program provided for the child to control 
what happened on the screen by reading writ's 
aloud was a powerful motivation to learn U 
read. The program proved especially successful 
with younger children (ages four to five) of 
average to high ability who were Just beginning 
to recognize words, and with slightly older 
children (ages six to seven) who were having 
difficulty learning to read. Tables 1-3 (below 
and on page 6) show the mean numbers of 
words learned by children at each of the three 
trials. 

Ironically, the technical problems with recogni- 
tion, which were most severe in our first trial, 
seemed to account for the children learning to 
read more of the words in that trial. That is, 
because children had torepeat the correct words 
more times— both to train the system and to 
read the story and play the games— they had 
more practice with them, which probably ex- 
plains why, according to the pretests and 
posttests we conducted, they learned more 
words. 

After each trial, we worked to improve both the 
reading software and the recognition software. 
In revising the reading software, we focused 
particularly on the nature of the child-machine 
interaction— the dialogue that occurred between 



Table 1. Number of words (out of ten) learned In Trial #1 
Kindergarteners - 10th month of the school year 



Teacher 
Rating 
of Ability 


Mean Age 
Vrs-Mos 


Mean 

Pretest 

Score 


Mean 
Posttest 
Score 


Mean Words 
Learned 
(Range) 


Low (n=5) 


5-7 


0.0 


3.6 


3.5 (2-6) 


Average (n=5) 


6-0 


6.4 


6.4 


6.4 (5-8) 


H«gh (n=7) 


5-8 


7.7 


7.7 


6.6 (4-8) 
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Table 2. Number off words (out of ten) learned In Trial #2 
Kindergarteners - 3rd month of the school year 



Teacher 
Rating 
of Ability 


Mean Age 
Yrs-Mos 


Mean 

Pretest 

Score 


Mean 
Posttest 
Score 


Mean Words 
Learned 
(Range) 


Low (n«l) 


5-2 


0.0 


0.0 


0.0 


Average (n<=2) 


4-2 


0.0 


3.0 


3-0(2-4) 


High (n*3) 


5-3 


1.3 


5.0 


3.7 (2-8) 


Unrated (n=l 


5-4 


0.0 


2.0 


2.0 



First Graders - 3rd month of the school year 



Low (n=2) 


6-6 


0.5 


7.5 


7.0 (6-8) 


Average (n=2) 


6-8 


2.0 


8.5 


6.5 (6-7) 


High (n=4) 


6-6 


8.S 


10.0 


1.5(0-4) 


Table 3. Number of words (out of ten) learned In Trial #3 
Kindergarteners - 4th month of the school year 


Teacher 
Rating 
of Ability 


Mean Age 
Yrs-Mos 


Mean 
Pretest 
Score 


Mean 
Posttest 
Score 


Mean Words 
Learned 
(Range) 


Low (n=3) 


5-4 


2.7 


2.3 


0.0 




Average (n=2) 


5-0 


0.5 


2.5 


2.0(1-3) 


High (n=5) 


5-3 


3.4 


7.4 


4.0 (2-8) 
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the child and the program. Based on the kinds 
and frequency of prompts provided by the adult 
helping the children to use the program in the 
field tests, we added and, in some cases, re- 
moved prompts from the program. For ex- 
ample, we found it was useful for the recognizer 
to tell a child when his or her speech was too 
loud or too soft to be recognized. We then 
noted that we needed at least two different 
prompts for each case (for instance, "Louder, 
please," and "1 can't hear you") to avoid mo- 
notonous repetition. 

In adjusting the recognition software, we fo- 
cused on reducing the frequency of errors, which 
were of three major kinds: rejection (rejecting a 
correct word), substitution (accepting an incor- 
rect word), and insertion (responding to back- 
ground noise, throat clearing, and other non- 
speech sounds). 

Because we couldn't eliminate recognition er- 
rors entirely, we also made efforts to adjust the 
relative frequency of different kinds of errors. 
For instance, we found that most children were 
quite willing to repeat a correct word — several 
times If necessary— if the recognizer didn't ac- 
cept It for some reason. But when the recognizer 
accepted an incorrect word, they were either 
misled or confused. We therefore adjusted the 
software to make it more likely that the program 
would make rejection errors (rejecting correct 
words) than substitution errors (accepting In- 
correct words). 

We also experimented with different kinds of 
microphones. Initially, we used an inexpensive, 
hand-held microphone. However, the children 
tended to play with this microphone, twisting 
the cord around their hands, lifting the micro- 
phone over their heads, and so on. They conse- 
quently did not hold it a uniform distance from 
their mouths, which impaired the effectiveness 
of the recognizer. 

We then put this microphone in a stand, but the 
uniform distance problem wasn't solved, as the 
children bobbed their heads up and down, and 
back and forth. Finally, we settled on a headset 
microphone, which kept the distance from 
mouth to receiver constant. 7 

Overall, the recognizer's rate of accuracy— 
though significantly lower than the 99.3 per- 
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cent achieved under optimal conditions— 
seemed acceptable. In the games, where the 
recognizer had a more difficult task, needing to 
distinguish among several acceptable words at 
any time, the mean number of errors per game 
ranged from 4 to 5.63. In the story, where the 
recognizer only had to identify one correct 
word at any time, the mean number of errors per 
story ranged from 1.75 to 4.60® 

In the second two trials, after we had made the 
adjustment described above, nearly all of the 
errors were rejection errors— that is, the child 
said thecorrect word and the recognizer rejected 
it. Most errors occurred with certain words, 
possibly because of quirks in the recognizer. For 
example, the long vowel sound in the word 
"feed" seemed to cause the recognizer particular 
difficulty. Also, in line with the conventional 
wisdom in the field that SO percent of speech 
recognition errors occur among 20 percent of 
users, a small number of children tended to 
have large numbers of errors, thus skewing the 
mean error data. 

Even at the end of our field tests, we were still 
experimenting with the microphones and ad- 
Justing the speech software and hardware, For 
example, we were fine-tuning the software- 
controlled gain setting, which affects sound 
amplification and therefore the recognizer's re- 
sponses to background versus foreground noise. 
We were also able to use a separate hardware 
adjustment to raise the upper volume level that 
the recognizer would accept. We anticipated 
that ultimately we would be able to reduce the 
recognition error rate significantly. 

Implications for Future Work 

Probably the main lesson from our experience is 
that decisions related to human factors— every- 
thing from the kind of microphone used to the 
kinds of prompts and feedback the software 
provides — are critical in determining the likely 
success or failure of speech recognition with 
children. Moreover, the speech technology 
(both recognition and output) presents chal- 
lenges beyond those that designers of educa- 
tional software typically must deal with. 

For example, in our use of speech input together 
with speech output, we were in effect attempt- 
ing to simulate human dialogue. Some of the 
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kinds of difficulties this entails are illustrated by 
our efforts to design an effective training proce- 
dure. Since the child had to repeat each word 
several times during the training, the program 
somehow had to get the child to keep saying the 
word, but also to pause after each repetition 
(since the recognizer could handle only Isolated 
words). In our initial design, the program 
prompted the child by saying, "Please say snake." 
(or whatever the word was) for each iteration. 
Each repetition was also signaled by a change in 
the accompanying graphic. We expected chil- 
dren to begin to anticipate the prompt and to 
use the graphic due. However, they continued 
to waitfor theprompt, which slowed the training 
process considerably. 

In the second version, we therefore delayed the 
prompt (after the first instance of it), hoping 
that children would take the silence, along with 
the graphic, as a cue to proceed. But children 
continued to wait for the prompt, slowing the 
training down even further, rather than speed- 
ing it up. 

Finally, we added an introductory segment to 
the training that modeled the repetition of 
words and explicitly asked children to attend to 
the moving word on the screen. This explicit 
approach proved effective and eliminated the 
need for repeated verbal prompts. 

In addition to such challenges Introduced by 
the attempt to simulate human dialogue, other 
issues arise from limitations in both speech 
recognition and speech output technologies. 
The need for some adult supervision— at least in 
children's Initial encounters with the system 
and perhaps continuing— is a product of an 
inherent limitation of speech recognition sys 
terns that fall anywhere short of perfect perfor- 
mance. That is, unlike a person, the program 
cannot distinguish between recognizer error 
and child error. It cannot therefore encourage 
a child to repeat a correct word, in cases in 
which the recognizer rejects a correct response. 
Nor can it encourage a child to try another word 
when the error has been the child's rather than 
the recognizer's. The closest we came to being 
able to distinguish between human and 
recognizer error was in the story program, where 
we were able to provide different feedback de- 
ptndlng on whether the recognizer considered 



a response to be dose to the correct word or far 
from It. 

While the main problem with speech recogni- 
tion technology is insufficient discrimination, 
the main problem with speech output technol- 
ogy is inflexibility. We found that children 
varied considerably in their need for, and toler- 
ance of, verba 1 ; prompts. At one extreme, some 
children needed not only the encouragement of 
the computer prom pt to respond to the program 
but also the nodded or spoken approval of the 
adult who worked with the children in all of our 
trials. Other children, by contrast, were ex- 
tremely impatient with prompts and spoke over 
them . But the program provided essentially the 
same quantity and kinds of prompts to all 
children alike. (This is, In fact, another example 
of the way the system inadequately replicated 
human dialogue.) 

Such speech output problems may be amenable 
to technical solutions. Our system, for example, 
could not "listen" for speech while it was speak- 
ing, so a prompt, once begun, could not be 
interrupted. More advanced systems may be 
able to speakand listen simultaneously. We can 
also envision a system that would learn— on the 
basis of a user's typical speed of response — 
whether to prompt more or less. (We did add 
time-delayed repetitions of some of our speech 
prompts that occurred when there were long 
pauses before responses. When the adult present 
was instructed not to provide encouragement, 
these delayed prompts proved effective.) 

As the preceding examples suggest, some of the 
problems we encountered may be Inherent in 
the technology and therefore require instruc- 
tional solutions external to the programs. Others 
likely have technological solutions. Perfecting 
both sorts of solutions requires extensive test- 
ing with users in real settings. 

Despite the difficulties, however, the demon- 
stration programs we produced showed that 
speech recognition technology has the poten- 
tial to become a useful adjunct to early reading 
instruction. Students who used the system in 
our field tests ail responded favorably to It. Even 
those who were accustomed to using computers 
at home or at school were fascinated by a com- 
puter that could both listen and talk. The 
programs' high level of interactivity kept stu- 
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dents engaged, as our videotapes of the testing 
sessions show, even in the early tests when the 
system did not always run smoothly. Though 
the small number of students tested and their 
limited use of the system preclude strong claims 
for student learning, the learning gains we did 
see at least suggest that such a system holds 
promise. 

The positive results we achieved with a primi- 
tive speech recognition system and a computer 
with extremely limited memory and processing 
power underscore the potential of more ad- 
vanced technology. Since the mid- 1980s, when 
we did our work, speech recognition systems 
with much larger vocabularies, continuous 
speech capabilities, and much easier training 
requirements have been developed. But these 
systems are either quite costly or still in the 
laboratory, in fact, most commercially available 
speech recognizers today still have many of the 
same limitations as the system we used. Even 
Dragon Dictate, the costly voice-activated 
"typewriter'' currently marketed by Dragon Sys- 
tems to businesses, though it has a large vo- 
cabulary and the ability to learn a speaker's 
voice through use rather than a separate train- 
ing procedure, cannot handle continuous 
speech. As the technology improves and as 
prices come down, however, our experiment 
points the way toward applications of speech 
recognition that go beyond drilling students on 
reading vocabulary. 

Acknowledgements 

The software this paper describes was a collabo- 
rative effort in which I was Joined by Charles L. 
Thompson, then Director of the Center for 
Learning Technology at EDC; John H. 
McCloskey, whose programming genius made 
it all possible; and a number of people at Dragon 
Systems, who supplied the speech recognizer 
and served as technical resources on a variety of 
issues. They included Janet M. Baker, President 
of Dragon Systems; Mark Sidell, Principal 
Hardware Engineer; and Jed Roberts and David 
Pinto, Research Scientists. David Nelson, Di- 
rector of Media Services at EDC, conducted the 
videotaping of the test sessions. The staff and 
students of the Phillips School In Watertown, 
Massachusetts, the Lincoln-Eliot School in 
Newton, Massachusetts, and the Plowshares 



Childcare Program in Newton, Massachusetts, 
participated In the field tests. The Watertown 
school system kindly loaned us videotaping 
equipment, and the Educational Collaborative 
for Greater Boston provided space for the test- 
ing. The research upon which this paper draws 
was conducted largely by Beth Wilson, with 
Charles L Thompson and Philip P. Zodhiates, as 
part of EDC's subcontract to the Educational 
Technology Center at the Harvard Graduate 
School of Education. For their excellent sug- 
gestions for mproving this paper, 1 would es- 
pecially like to thank Leigh Peake and Cindy 
Char. 



Notes 

1. At the high end, for example, a voice-activated 
"typewriter," Dragon Dictate, with a basic vocabulary 
of 30,000 words, sells for $9,000 and requires an IBM 
PC AT compatible computer with 8 megabytes of 
memory, at an additional cost of $7,000 to $1 1,000 
("New Typewriter'" 1990; Rosenberg 1990). Even a 
far less sophisticated speech recognition system, the 
Voice Navigator from Articulate Systems, which runs 
on the Macintosh and recognizes 200 words at a time, 
sells for $1295. 

2. A small number of speech recognizers marketed to 
schools, and an even smaller number of educational 
applications, areclted by Parham (1988) andStrickland 
eta!. (1987). 

3. The Chatterbox voice input card, which Is avail- 
able for both Apple II series and IBM PC compatible 
computers, sells for about $600; the software, which 
costs $245, also requires an Echo speech synthesizer 
($129 for the Apple version) and a headphone/micro- 
phone ($75). 

4. Much of the information in this paper is drawn 
from the report of a small research study conducted 
by the Educational Technology Center at the Harvard 
Graduate School of Educau which looked at some 
of the reliability and h uman interface issues raised by 
the recognizer (Thompson et al. 1985). 

5. Although the system could actually handle up to 
thirty-two words at one time, we limited the vocabu- 
lary to eighteen words and then split it in two parts 
because of the need to have children train all the 
words used. In fact, we found that young children 
were willing to train ten words at one sitting, but not 
all eighteen. Beyond the initial eighteen-word vo- 
cabulary, however, additional vocabularies could be 
created by teachers or parents and trained and used 
by children. (See the discussion of additional vo- 
cabularies in the section entitled The Games.") 
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6. In addition to problems with the speech recogni- 
tion software and the microphones (described be- 
low), we also experienced some failures of the entire 
system due to static electricity. 

7. However, at the time we stopped working on the 
product we were stilt seeking a headset microphone 
of sufficient quality and sufficiently low price that 
would fit well on child-sized heads. 

8. Mean errors were calculated by adding the total 
number of recognition errors of all types by ail 
children reading the story or playing a game during 
one of the trials and then dividing the total number 
of errors by the number of children who read a story 
or played a game. The figures therefore do not 
accurately reflect the number of errors for any one 
child reading the story or playing a game. Moreover, 
the total number of utterances, which varied widely 
(especially in the games) was not taken into account 
in calculating the mean errors. Selected data, for 
example, show that the total number of utterances 
per game could range from as low as8 to as high as 61 . 
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